Word frequency–rank relationship in tagged texts

نویسندگان

چکیده

We analyze the frequency–rank relationship in sub-vocabularies corresponding to three different grammatical classes (nouns, verbs, and others) a collection of literary works English, whose words have been automatically tagged according their role. Comparing with null hypothesis which assumes that belonging each class are uniformly distributed across frequency–ranked vocabulary whole work, we disclose statistically significant differences between classes. This results point fact relationships may reflect linguistic features associated function.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Use of the Word “‘Ishq” in Religious Texts

Some Muslim thinkers have had disagreements over the use of the word “‘ishq” in religious texts since many years ago. The main claim made by those who prohibit the use of this word for the sacred associations such as God, the Prophet (s), and Imāms is that the meaning of this word is appropriate only to the material and carnal issues and so, it has not been used in religious texts and its conce...

متن کامل

“Word Sense Disambiguation: Importance of human familiarity with texts to be sense-tagged”

In this paper a research on the importance of human familiarity with texts to be sense-tagged in the field of Word Sense Disambiguation is presented. This research investigates whether people who are more familiar with English texts about physics, better disambiguate words in such a text, than people who are not that familiar with such a text. For the population considered in this study became ...

متن کامل

Exploiting Parallel Texts to Produce a Multilingual Sense Tagged Corpus for Word Sense Disambiguation

We describe an approach to the automatic creation of a sense tagged corpus intended to train a word sense disambiguation (WSD) system for English-Portuguese machine translation. The approach uses parallel corpora, translation dictionaries and a set of straightforward heuristics. In an evaluation with nine corpora containing 10 ambiguous verbs, the approach achieved an average precision of 94%, ...

متن کامل

Flexible Diagram Generation from Tagged Texts

This paper introduces a novel diagram generation scheme for presenting the inner structures of a text. We first describe how semantic structures of a text can be translated into a diagram: tagging to a text, followed by translation to a diagram. The generated diagrams are tightly linked to the texts, and the media complex of the text and the diagram effectively explains the contents of a docume...

متن کامل

Identifying Word Translations in Non-Parallel Texts

Common algorithms for sentence and word-alignment allow the automatic identification of word translations from parallel texts. This study suggests that the identification of word translations should also be possible with non-parallel and even unrelated texts. The method proposed is based on the assumption that there is a correlation between the patterns of word cooccurrences in texts of differe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Physica D: Nonlinear Phenomena

سال: 2021

ISSN: ['1872-8022', '0167-2789']

DOI: https://doi.org/10.1016/j.physa.2021.126020